Back

Genetics in Medicine

Elsevier BV

Preprints posted in the last 7 days, ranked by how well they match Genetics in Medicine's content profile, based on 69 papers previously published here. The average preprint has a 0.09% match score for this journal, so anything above that is already an above-average fit.

1
Determinants of DNA-sequence-based Diagnostic Yield in the CSER Consortium

Mavura, Y.; Crosslin, D.; Ferar, K. D.; Lawlor, J. M.; Greally, J. M.; Hindorff, L.; Jarvik, G. P.; Kalla, S.; Koenig, B. A.; Kvale, M.; Kwok, P.-Y.; Norton, M.; Plon, S. E.; Powell, B. C.; Slavotinek, A.; Thompson, M. L.; Popejoy, A. B.; Kenny, E. E.; Risch, N.

2026-04-22 genetic and genomic medicine 10.64898/2026.04.20.26351140 medRxiv
Top 0.1%
52.9%
Show abstract

PurposeDiagnostic yield from exome and genome sequencing varies widely across studies. It remains unclear how much of this variation reflects patient-level factors (e.g., sex, clinical features, race/ethnicity, genetic ancestry) versus site-level practices such as sequencing modality or variant interpretation workflows. We aimed to quantify the contributions of these factors to diagnostic outcomes across five U.S. clinical sequencing sites. MethodsWe performed a cross-sectional analysis of 3,008 prenatal, neonatal, and pediatric cases from the NHGRI Clinical Sequencing Evidence-Generating Research (CSER) consortium (2017-2023). Clinical indications spanned neurodevelopmental, neurological, immunological, metabolic, craniofacial, skeletal, cardiac, prenatal, and oncologic presentations. Genetic ancestry was inferred from sequencing data, and variants were interpreted using ACMG/AMP guidelines to classify DNA-based diagnoses. Generalized linear mixed models were used to estimate associations between diagnostic yield and fixed effects (sex, prenatal status, isolated cancer, number of clinical indications, sequencing modality, race/ethnicity, and genetic ancestry), while modeling study site as a random effect to quantify between-site variation. ResultsThe overall diagnostic yield was 19.0%. Multiple clinical indications (OR=1.47, 95% CI 1.20-1.80, p<0.001) were associated with higher diagnostic yield, and male sex (OR=0.80, 95% CI 0.66-0.96, p=0.017) and prenatal status (OR=0.63, 95% CI 0.44-0.90, p=0.012) were associated with lower yield. Sequencing modality, race/ethnicity, genetic ancestry, and isolated cancer were not statistically significantly associated with diagnostic outcomes.. A model without fixed effects attributed [~]10% of variance in diagnostic yield to between-site differences. After adjusting for covariates, site-level variance decreased to 5.7%, indicating consistent variation across sites not explained by measured patient factors. ConclusionAcross five sites, patient-level clinical features influenced diagnostic yield, but substantial site-level variation remained even after adjustment. Differences in variant interpretation, or case-classification practices may contribute to this residual variability. Further efforts to increase consistency in exome- and genome-sequencing diagnostic workflows may help reduce inter-site differences.

2
Investigating Uptake and Impact of Genetic and Genomic Evaluation Following a Perinatal Demise

Mossler, K.; D'Orazio, E.; Hall, K.; Osann, K.; Kimonis, V.; Quintero-Rivera, F.

2026-04-23 genetic and genomic medicine 10.64898/2026.04.22.26347546 medRxiv
Top 0.1%
33.8%
Show abstract

Objective The decline of the perinatal demise rate is slowing and demises are often unexplained. Significant research has been done regarding diagnostic yield and genetic causes of demise, but little is known about how Geneticist involvement impacts outcomes. The goal of the study was to evaluate post-mortem genetic testing practices and effects of the geneticists involvement. Methods Retrospective data from 111 perinatal demise cases was examined, including rates of prenatal genetic counseling, post-delivery genetics consult, genetic testing, and autopsy investigation. Results In this cohort 54% received genetic testing and 25% received a genetics consult. When compared to those without, cases with genetic specialist involvement were associated with significant increases in testing uptake (p=0.007), diagnostic yield (p<0.001), and patient education (p<0.001). Second trimester stillbirths and those with fewer ultrasound (US) abnormalities were less likely to receive genetic testing (both p values <0.001) and consults (p<0.001, p=0.020). Conclusion Though it was not possible to avoid ascertainment bias, this data demonstrates that geneticist involvement correlates with a higher rate of testing, greater diagnostic yield, and more thorough counseling. These findings underscore the importance of integrating genetics providers into perinatal postmortem healthcare teams.

3
Novel Genetic Risk Loci for Pancreatic Ductal Adenocarcinoma Identified in a Genome-wide Study of African Ancestry Individuals

Vergara, C.; Ni, Z.; Zhong, J.; McKean, D.; Connelly, K. E.; Antwi, S. O.; Arslan, A. A.; Bracci, P. M.; Du, M.; Gallinger, S.; Genkinger, J.; Haiman, C. A.; Hassan, M.; Hung, R. J.; Huff, C.; Kooperberg, C.; Kastrinos, F.; LeMarchand, L.; Lee, W.; Lynch, S. M.; Moore, S. C.; Oberg, A. L.; Park, M. A.; Permuth, J. B.; Risch, H. A.; Scheet, P.; Schwartz, A.; Shu, X.-O.; Stolzenberg-Solomon, R. Z.; Wolpin, B. M.; Zheng, W.; Albanes, D.; Andreotti, G.; Bamlet, W. R.; Beane-Freeman, L.; Berndt, S. I.; Brennan, P.; Buring, J. E.; Cabrera-Castro, N.; Campa, D.; Canzian, F.; Chanock, S. J.; Chen, Y.;

2026-04-22 genetic and genomic medicine 10.64898/2026.04.21.26351329 medRxiv
Top 0.2%
9.1%
Show abstract

Pancreatic cancer disproportionately affects Black individuals in the United States, but they have limited representation in genetic studies of pancreatic ductal adenocarcinoma (PDAC). To address this gap, we performed admixture mapping and genome-wide association analysis (GWAS) in genetically inferred African ancestry individuals (1,030 cases and 889 controls). Admixture mapping identified three regions with a significantly higher proportion of African ancestry in cases compared to controls (5q33.3, 10p1, 22q12.3). GWAS identified a genome-wide significant association at 5p15.33 (CLPTM1L, rs383009:T>C, T Allele Frequency=0.51, OR:1.45, P value=1.24x10-8), a locus previously associated with PDAC. Known loci at 5p15.33, 7q32.3, 8q24.21 and 7q25.1 also replicated (P value <0.01). Multi-ancestral fine-mapping identified two potential causal SNPs (rs3830069 and rs2735940) at 5p15.33. Collectively these findings identified novel PDAC risk loci and expanded our understanding of this deadly cancer in underrepresented populations, emphasizing the multifactorial nature of PDAC risk including inherited genetic and non-genetic factors. Statement of SignificanceTo understand how genetic variation contributes to PDAC risk in Black people in North American, we studied individuals of genetically-inferred African ancestry. We identified novel risk loci and differences in the contribution of known loci. This demonstrates that ancestry-informed genetic analyses improve our understanding of PDAC risk and enhances discovery.

4
Biventricular cardiac dynamic shape: genetics and cardiometabolic disease associations

Burns, R.; Young, W. J.; Uddin, K.; Petersen, S. E.; Ramirez, J.; Young, A. A.; Munroe, P. B.

2026-04-20 genetic and genomic medicine 10.64898/2026.04.19.26350940 medRxiv
Top 0.4%
4.1%
Show abstract

BackgroundGenetic studies using cardiac magnetic resonance (CMR) imaging have identified loci related to cardiac shape, but most focus on static morphology. The value of a dynamic cardiac shape atlas capturing both shape and function remains unknown. MethodsA dynamic shape atlas comprising CMR-derived shape models at end-diastole and end-systole was combined with genetic and outcome data in 36,992 UK Biobank participants. Dynamic shape principal components (PCs) describing >1% of variance were characterized, and tested for associations with prevalent and incident cardiometabolic diseases, including ischemic heart disease (IHD), heart failure (HF), significant atrioventricular block (AVB), and atrial fibrillation (AF), and independent predictive power alongside standard CMR measures. Genome-wide association studies (GWAS) were performed to identify candidate genes and biological pathways, and polygenic risk scores (PRS) were assessed for disease associations. Mendelian randomization (MR) was performed to test causality of observed disease associations. ResultsWe identified 14 dynamic cardiac shape PCs capturing 83.3% of total dynamic cardiac shape variance. These PCs captured distinct functional remodeling patterns such as variation in annular plane systolic excursion, while remaining only modestly correlated with standard CMR measures. All 14 PCs were associated with at least one incident cardiometabolic disease, with the strongest associations observed for incident IHD, HF, and AVB. Notably, incorporating dynamic shape PCs improved the prediction of incident IHD beyond standard CMR measures. GWAS identified 75 genetic loci associated with dynamic shape, including 14 variants previously unreported for cardiac traits, and candidate genes demonstrated enrichment in pathways related to cardiac development and contractile function. PRS derived from dynamic shape loci were significantly associated with multiple outcomes, most prominently HF. MR identified significant causal relationships between several PCs and cardiometabolic disease. ConclusionsDynamic cardiac shape features capture aspects of cardiac structure and function not fully represented by standard CMR measures. These features are strongly associated with incident cardiometabolic disease and provide new insights into the genetic architecture of cardiac remodeling. Clinical perspectiveO_ST_ABSWhat is new?C_ST_ABSO_LIGenetic and outcome relationships with a dynamic statistical shape model capturing both left and right ventricles at end-diastole and end-systole. C_LIO_LIDemonstration of incremental value over existing cardiac shape models, through capture of functional remodeling not represented by standard imaging measures. C_LIO_LIIdentification of genetic susceptibility loci for dynamic cardiac shape, including 14 variants not previously reported for cardiac traits. C_LI What are the clinical implications?O_LIThe results enhance our understanding of the genetic architecture of dynamic cardiac shape and function in the general population and clarify their relationships with other cardiovascular endophenotypes and incident cardiometabolic diseases. C_LIO_LINewly identified candidate genes expand the biological pathways implicated in cardiac remodeling and provide targets for future functional and mechanistic studies. C_LIO_LIThe improved prediction of incident cardiometabolic disease, particularly ischemic heart disease, achieved by adding dynamic shape PCs to traditional CMR measures suggests potential value for their inclusion in evaluation of patients. C_LI

5
Duplication within 14q32.13 implicates a chimeric CLMN::SYNE3 RNA transcript in cerebellar ataxia

Litster, T. M.; Wilcox, R. A.; Carroll, R.; Gardner, A. E.; Nazri, N. M.; Shoubridge, C. A.; Delatycki, M. B.; Lohmann, K.; Agzarian, M.; Turella Divani, R.; Rafehi, H.; Scott, L.; Monahan, G.; Lamont, P. J.; Ashton, C.; Laing, N. G.; Ravenscroft, G.; Bahlo, M.; Haan, E.; Lockhart, P. J.; Friend, K. L.; Corbett, M. A.; Gecz, J.

2026-04-24 genetic and genomic medicine 10.64898/2026.04.23.26350376 medRxiv
Top 0.4%
3.9%
Show abstract

The spinocerebellar ataxias (SCAs) are a clinically heterogenous group of neurodegenerative disorders that affect movement, vision, speech and balance. Here, we reassign the linkage of SCA30 to 14q32.13 based on a cumulative LOD score >12. Within this interval we identified a 331 kb duplication, absent in population controls and not observed in >800 unrelated individuals with genetically unresolved cerebellar ataxia. RNASeq analysis of patient-derived lymphoblastoid cell lines revealed a splice-mediated chimeric transcript resulting from the duplication event. This transcript joined exon 1 of CLMN to exon 2 of SYNE3. In silico translation predicted that this chimeric transcript would produce a short N-terminal peptide corresponding to exon 1 of CLMN and the usually untranslated region of exon 2 of SYNE3 fused to the complete and in-frame SYNE3 protein. Transient overexpression of SYNE3 or the CLMN::SYNE3 fusion protein, in both HeLa cells and mouse primary cortical neurons, resulted in equivalent cellular outcomes including altered nuclear morphology and chromosomal DNA fragmentation. SYNE3 forms part of the linker of nucleoskeleton and cytoskeleton complex and is not usually expressed in cerebellar Purkyn[e] neurons while, CLMN has a Purkyn[e] specific expression pattern within the brain. Our data suggests that ectopic expression of SYNE3 in cerebellar Purkyn[e] neurons, mediated by the CLMN promoter, leads to cerebellar atrophy and causes spinocerebellar ataxia in the SCA30 family. This is an example of Mendelian disease arising from a novel, chimeric transcript with a likely dominant negative effect. Chimeric transcripts are commonly associated with cancers, but they are not often associated with monogenic disorders. Detection of chimeric transcripts as part of structural variant analysis could increase the genetic diagnostic yield of Mendelian disorders.

6
X-Chromosome-Wide Association Study Identifies Novel Genetic Signals for Heart Failure and Subtypes

Ren, J.; VA Million Veteran Program, ; Liu, C.; Hui, Q.; Rahafrooz, M.; Kosik, N. M.; Urak, K.; Moser, J.; Muralidhar, S.; Pereira, A.; Cho, K.; Gaziano, J. M.; Wilson, P. W. F.; Million Veteran Program, V.; Phillips, L. S.; Sun, Y.; Joseph, J.

2026-04-23 genetic and genomic medicine 10.64898/2026.04.21.26351435 medRxiv
Top 0.4%
3.6%
Show abstract

Background: Heart failure (HF) is a major and growing public health problem, and prior studies support a meaningful genetic contribution to HF susceptibility. Clinically, HF is commonly categorized into the major clinical sub-types of HF with reduced ejection fraction (HFrEF) and HF with preserved ejection fraction (HFpEF), which differ in pathophysiology and clinical profiles. However, previous genome-wide association studies have focused on autosomal variation and have routinely excluded the X chromosome, leaving X-linked genetic contributions to HF and its subtypes under-characterized. Methods: We performed X-chromosome wide association study (XWAS) utilizing directly genotyped data from 590,568 Million Veteran Program participants, including 90,694 HF cases across European, African, Hispanic, and Asian Americans. Sex- and ancestry-stratified logistic regression was used with XWAS quality control measures, adjusting for age and population structure, followed by fixed-effects multi-ancestry meta-analysis. Functional annotation, gene-based testing, fine-mapping, and colocalization were performed. We replicated genetic associations with all-cause HF in the UK Biobank. Results: In the multi-ancestry meta-analysis, we identified five X-chromosome-wide significant loci for all-cause HF, five for HFrEF, and one locus for HFpEF in males. No loci reached significance in female-specific analyses. In sex-combined analyses, we identified six loci for all-cause HF and four for HFrEF. The strongest and most emphasized signals mapped to genes were BRWD3, FHL1, and CHRDL1. Ancestry-specific analyses revealed additional loci, including NDP and WDR44 in African ancestry and PHF8 in Hispanic ancestry. One locus, BRWD3, was replicated in UK Biobank HF cohort. Integrated post-GWAS analyses (fine-mapping, colocalization and pleiotropy trait association studies) reinforced the biological plausibility of the X-linked signals. Conclusions: This multi-ancestry, sex-stratified XWAS identifies X-linked genetic contributions to HF and its subtypes and highlights the role of X-chromosome in heart failure pathogenesis.

7
Comprehensive Exome Sequencing in Swedish Patients with Spontaneous Coronary Artery Dissection

Gunnarsson, C.; Ellegard, R.; Ahsberg, J.; huda, s.; Andersson, J.; Dworeck, C. F.; Glaser, N.; Erlinge, D.; Loghman, H.; Johnston, N.; Mannila, M.; Pagonis, C.; Ravn-Fischer, A.; Rydberg, E.; Welen Schef, K.; Tornvall, P.; Sederholm Lawesson, S.; Swahn, E. E.

2026-04-24 genetic and genomic medicine 10.64898/2026.04.22.26351535 medRxiv
Top 0.5%
3.0%
Show abstract

Abstract Background Spontaneous coronary artery dissection (SCAD) is a well-recognised cause of acute coronary syndrome particularly among women without conventional cardiovascular risk factors. Increasing evidence indicates a genetic contribution; however, the underlying genetic architecture of SCAD remains insufficiently understood. Objective The aim of this study was to assess the prevalence of rare variants in previously reported SCAD associated genes and to explore the potential presence of novel genetic alterations in well-characterised Swedish patients with SCAD. Methods The study comprised 201 patients enrolled in SweSCAD, a national project examining the clinical characteristics, aetiology, and outcomes of SCAD. All individuals had a confirmed diagnosis based on invasive coronary angiography. Comprehensive exome sequencing was performed to identify rare variants contributing to disease susceptibility. Results Genetic variants that have been associated with SCAD according to current clinical genetics practice for variant reporting were identified in approximately 4 % of patients. In addition, rare potentially relevant variants were detected in almost 60 % of patients in genes associated with vascular integrity and vascular remodelling. Conclusion This study supports SCAD as a genetically complex arteriopathy, driven by rare high?impact variants together with broader polygenic susceptibility. Variants in collagen, vascular extracellular matrix, and oestrogen?responsive pathways provide biologically plausible links to female?predominant disease. Although the diagnostic yield of clearly actionable variants is modest, these findings support broader genomic evaluation beyond overt syndromic presentations and highlight the need for larger integrative genomic and functional studies to refine risk stratification and management.

8
De novo EHMT2 variants cause an autosomal dominant EHMT2-related Kleefstra syndrome via loss of G9a methyltransferase activity.

Hnizda, A.; Martinez-Delgado, B.; Sanchez-Ponce, D.; Alonso, J.; Amiel, J.; Attie-Bitach, T.; Bada-Navarro, A.; Baladron, B.; Bermejo-Sanchez, E.; Brinsa, V.; Bukova, I.; Cazorla-Calleja, R.; Cervenkova, S.; Chow, S.; Dusek, P.; Fedosieieva, O.; Fernandez-Prieto, M.; Ghosh, S.; Gomez-Mariano, G.; Gregorova, A.; Hamilton, M. J.; Hartmannova, H.; Hernandez-San Miguel, E.; Herrero-Matesanz, M.; Hodanova, K.; Kadek, A.; Kerkhof, J.; Kleefstra, T.; Lacombe, D.; Levy, M. A.; Lopez-Martin, E.; Lyse, R.; Man, P.; Marin-Reina, P.; Macnamara, E. F.; McConkey, H.; Melenovska, P.; Mielu, L. M.; Moore, D.;

2026-04-20 genetics 10.1101/2025.09.25.678439 medRxiv
Top 0.5%
2.5%
Show abstract

EHMT1 and EHMT2 genes encode human euchromatin histone lysine methyltransferase 1 and 2 (EHMT1 alias GLP; EHMT2 alias G9a) that form heteromeric GLP/G9a complexes with essential roles in epigenetic regulation of gene expression. While EHMT1 haploinsufficiency has been established as the cause of Kleefstra syndrome 1, the pathogenesis of G9a dysfunction in human disease remains largely unknown. We identified seven de novo EHMT2 variants in patients with clinical presentation, episignatures, histone modifications and transcriptomic profiles similar to those of Kleefstra syndrome 1. In vitro studies revealed that these variants encode for structurally stable G9a proteins that are catalytically incompetent due to aberrant interactions either with histone H3 tail or with S-adenosylmethionine. Heterozygous mice carrying a patient-derived variant exhibited growth retardation, facial/skull dysmorphia and aberrant behavior. Here we report pathogenic EHMT2 variants that likely exert dominant-negative effect on GLP/G9a complexes and thus genocopy the EHMT1 haploinsufficiency via a distinct molecular mechanism, defining an autosomal dominant EHMT2-related Kleefstra syndrome.

9
PALM3 and hearing loss: a potential dual diagnosis interfering with novel gene discovery

Najarzadeh Torbati, P.; Hallbrucker, L.; Hofrichter, M. A. H.; Owrang, D.; Setzke, J.; Kilimann, M. W.; Hemmatpour, A.; Rajati, M.; Ghayoor Karimiani, E.; Haaf, T.; Vogl, C.; Vona, B.

2026-04-21 genetic and genomic medicine 10.64898/2026.04.20.26351093 medRxiv
Top 0.5%
2.3%
Show abstract

Hereditary hearing loss is highly genetically heterogeneous, with emerging overlap between genes implicated in early-onset and age-related hearing loss. We report a consanguineous family with autosomal recessive, non-syndromic hearing loss in which the proband harbors a homozygous splice-site variant in PALM3 (NM_001145028.2:c.314+1G>A) and a homozygous missense variant in OTOA. A minigene assay for the PALM3 variant demonstrated aberrant splicing with exon skipping, resulting in a frameshift and a large inframe deletion, both consistent with loss of function and impacting all known transcripts. While the organ of Corti from 12-month-old heterozygous Palm3 mice showed preserved overall architecture, published Palm3 knockout mice exhibit auditory dysfunction, supporting an auditory phenotype with loss of function. Although a dual molecular diagnosis cannot be excluded, the combined genetic, functional, and comparative data support PALM3 as a strong candidate gene for autosomal recessive hearing loss.

10
Functional Connectivity of the Neonatal Cerebellum is Impacted by Sex and Polygenic Liability for Autism

Wagner, L.; Chiem, E.; Liu, J.; Hernandez, L. M.

2026-04-19 genetic and genomic medicine 10.64898/2026.04.17.26351076 medRxiv
Top 0.7%
1.5%
Show abstract

The cerebellum rapidly integrates with cerebral networks during infancy and shows consistent structural and functional alterations in Autism Spectrum Disorder (ASD), suggesting that early cerebellar development may be consequential for later behavioral and psychiatric outcomes. Yet, little is known about the effect of ASD genetic liability on cerebello-cerebral functional connectivity in infancy or whether effects may differ by biological sex. Here, we leveraged neonatal functional magnetic resonance imaging, genetic, and behavioral follow-up data from the Developing Human Connectome Project (dHCP) to examine the relationship between ASD polygenic scores (PGS) and functional connectivity of cerebellar regions associated with sensorimotor and social-cognitive functions in 198 term-born neonates (mean age: 9.7 days). We report widespread sex differences in neonatal cerebello-cerebral connectivity that are regionally specific across cerebellar subdivisions. Across the full sample, elevated ASD PGS predicted alterations in cerebello-cerebral connectivity, with hemisphere-dependent differences in sensorimotor cerebellar connectivity with temporal cortex, and hyperconnectivity between the right social-cognitive seed and posterior cingulate. Notably, elevated ASD PGS predicted opposing patterns of cerebello-cerebral connectivity in males and females, including male hyperconnectivity between the right sensorimotor cerebellum and default mode areas, and female hyperconnectivity between the right social-cognitive seed and sensorimotor cortex. Connectivity associated with elevated ASD PGS showed nominal, sex-specific associations with 18-month language ability, attention problems, and emotional reactivity. Our findings show that ASD PGS influences the functional configuration of the cerebellum at birth and suggest that underlying cerebellar connectivity profiles associated with ASD may partially underlie distinct behavioral presentations in males and females.

11
Linguistic Validation of the Rett Syndrome Behavior Questionnaire Spanish Translation: a Two-Stage Caregiver Study Across Latin America

Polo Sanchez, M.; Lesmes, A. C.; Muni, N.; Vigneault, F.; Novak, R.

2026-04-23 neurology 10.64898/2026.04.16.26349544 medRxiv
Top 0.8%
0.9%
Show abstract

Background: Rett Syndrome (RTT) is a severe neurodevelopmental disorder affecting approximately 1 in 10,000 live female births worldwide. The Rett Syndrome Behaviour Questionnaire (RSBQ), remains one of the most widely used standardized behavioral assessment tools for RTT. However, the RSBQ was originally validated only in British English, limiting its applicability for Spanish-speaking caregivers and clinical centers across Latin America and Spain. Objective: The primary aim of this study was to develop and validate the comprehension of the Spanish translation of the RSBQ to ensure cultural and linguistic equivalence, enhance data reliability, and facilitate earlier, more accurate clinical assessments among Spanish-speaking RTT populations. Methods: Surveys were administered in two phases to Spanish-speaking caregivers between November 2023 and September 2025. Phase I consisted of 12 guided survey administrations with participants being able to ask clarifying questions and offer linguistic modifications of RSBQ questions. Phase II consisted of independent online administration of the refined Spanish RSBQ and a retest at least 7 days later. Participants were recruited through direct outreach and supported virtually during questionnaire completion. Results: Following data cleaning and quality control, a total of 51 caregivers successfully completed both surveys. The Spanish RSBQ demonstrated high caregiver comprehension and strong engagement across multiple Latin American countries, including Argentina, Mexico, and Peru. Responses were highly correlated between test and retest timepoints, and no question showed biased response distributions. A slight effect of response interval on test-retest correlation was observed, potentially indicating the impact of natural disease progression confounding retest evaluation for long (>80 day) intervals; however this effect did not impact the overall linguistic validation results as analysis of only <21 day test-retest responders confirmed the findings. Conclusions: This linguistic validation study represents the first formal step toward the clinical validation of the Spanish RSBQ, enabling broader inclusion of Spanish-speaking populations in RTT research. The collaborative, bilingual data collection strategy proved both feasible and effective, paving the way for multinational trials and expanding therapeutic accessibility through localized, patient-centered innovation.

12
Transcriptome-Wide Alternative Splicing Analysis Implicates Complex Events in Bipolar Disorder

Martinez-Jimenez, M.; Garcia-Ortiz, I.; Romero-Miguel, D.; Kavanagh, T.; Marshall, L. L.; Bello Sousa, R. A.; Sanchez Alonso, S.; Alvarez Garcia, R.; Benavente Lopez, S.; Di Stasio, E.; Schofield, P. R.; Baca-Garcia, E.; Mitchell, P. B.; Cooper, A. A.; Fullerton, J. M.; Toma, C.

2026-04-21 genetic and genomic medicine 10.64898/2026.04.19.26351209 medRxiv
Top 0.9%
0.8%
Show abstract

Alternative-splicing events (ASE) increase transcriptomic variability and play key roles in biological functions. The contribution of ASE to bipolar disorder (BD) remains largely unexplored. We performed a Transcriptome-Wide Alternative-Splicing Analysis (TWASA) to identify ASEs and genes potentially involved in BD. The study comprised 635 individuals: a discovery sample (DS) of 31 individuals from eight multiplex BD families (16 BD cases; 15 unaffected relatives), and a replication sample (RS) of 604 subjects (372 BD cases; 232 controls). Sequencing was conducted on RNA from lymphoblastoid cell lines (DS) and whole blood (RS). TWASA was performed using VAST-TOOLS (VT), rMATS (RM), and MAJIQ/MOCCASIN (MCC). Gene-set association analyses of genes containing ASEs were performed across six psychiatric disorders. Novel ASE (nASE) were investigated in the DS using FRASER. Limited gene overlap was observed across TWASA tools. MCC identified 2,031 complex ASEs involving 1,508 genes, showing the strongest genetic association with BD across psychiatric phenotypes. Prioritization of MCC-identified ASE genes yielded 441 candidates, including DOCK2 as top candidate from the DS. Replication was obtained for 98 genes, five with an identical ASE, and four (RBM26, QKI, ANKRD36, and TATDN2) showing a concordant percentage-spliced-in direction with the DS. Finally, 578 nASE were identified in the DS, with no evidence of familial segregation or differences in ASE types. This first TWASA in BD reveals tool-specific variability, complex ASE for genes specifically associated with BD, and novel candidate genes for BD. Alternative transcript isoform abundance may represent a mechanism contributing to BD pathophysiology.

13
Beyond Seizure Burden: Seizure Semiology, but not Frequency, Is Associated With Caregiver-Reported Autistic Behaviors in SYNGAP1-DEE

Kiwull, L.; Schmeder, V.; Zenker, M.; Mengual Hinojosa, M.; Perkins, J. R.; Ranea, J.; Kluger, G.; Hartlieb, T.; Pringsheim, M.; von Stuelpnagel, C.; Weghuber, D.; Eschermann, K.

2026-04-21 neurology 10.64898/2026.04.19.26351217 medRxiv
Top 0.9%
0.8%
Show abstract

1.PurposeSYNGAP1-related developmental and epileptic encephalopathy (SYNGAP1-DEE) is characterized by high rates of both epilepsy and autism spectrum disorder (ASD). While the clinical spectrum is well-documented, the link between specific seizure semiologies and caregiver-reported autistic behaviors is not well understood. This study analyzed the correlation between ten distinct seizure types, their frequencies, and a caregiver-reported autistic behavior score. MethodClinical data were extracted from the PATRE (PATient-based phenotyping and evaluation of therapy for Rare Epilepsies) Registry for SYNGAP1, in the framework of the EURAS project (Grant No. 101080580, Horizon Europe). This study employed a retrospective cross-sectional analysis of caregiver-reported registry data. Analysis was restricted to an analytic cohort of N=337 participants with complete data for both the epilepsy questionnaire (including epilepsy status, seizure semiology, and peak seizure frequency items) and the behavior questionnaire (from a total N=522 registry participants). Caregiver-reported autistic behaviors were quantified using a standardized caregiver-reported scale (Likert 1-5). Statistical associations were evaluated using the Wilcoxon rank-sum test to compare caregiver-reported autistic behavior scores across different seizure semiologies and Spearmans rank correlation to assess the impact of seizure frequency (9-point scale). ResultsWithin the analytic cohort (N=337), epilepsy was reported in 259 patients. Eyelid myoclonia was the most prevalent semiology, affecting 64.9% (n=168) of the epilepsy-positive group. Atypical absences (n=77) demonstrated the most profound and statistically robust association with higher caregiver-reported autistic behavior scores (FDR-adjusted p = 0.001). Significant associations were also observed for typical absences (n=70, FDR-adjusted p = 0.018), eyelid myoclonia (FDR-adjusted p = 0.018), myoclonic-atonic seizures (n=40, FDR-adjusted p = 0.019), and atonic seizures (n=72, FDR-adjusted p = 0.025). Focal and tonic-clonic seizures showed weaker associations (FDR-adjusted p = 0.026 and p = 0.047, respectively). Crucially, quantitative analysis revealed no significant correlation between ordinal caregiver-reported peak seizure frequency ratings and caregiver-reported autistic behavior scores across all semiologies (e.g., Eyelid Myoclonia: p=0.096; Atypical Absences: p=0.744), indicating no detectable association between peak-frequency ratings and caregiver-reported autistic behavior scores. ConclusionHigher caregiver-reported autistic behavior scores in SYNGAP1-DEE were most strongly associated with the presence of atypical absences, representing a generalized, thalamocortical seizure network dysfunction. In contrast, no detectable association was observed between caregiver-reported autistic behavior scores and the ordinal caregiver-reported peak seizure frequency metric. Atypical absences and related semiologies may serve as clinical "red flags" for increased neurodevelopmental comorbidity severity, regardless of reported peak seizure frequency. Abstract SummaryThis study investigates the relationship between ten seizure semiologies, seizure frequency, and severity of caregiver-reported autistic behaviors in a large-scale international cohort of N=337 patients with SYNGAP1-DEE. We identify a robust association between elevated caregiverreported autistic behavior scores and specific thalamocortical seizure patterns, most prominently atypical absences. Notably, our analysis reveals that this association is independent of seizure frequency, demonstrating no detectable association between this ordinal, caregiver-reported seizure frequency metric and caregiver-reported autistic behaviors.

14
The burden of neurogenic orthostatic hypotension in patients with multiple system atrophy: a real-world study

Kmiecik, M. J.; O'Brien, L.; Szpyhulsky, M.; Iodice, V.; Freeman, R.; Jordan, J.; Biaggioni, I.; Kaufmann, H.; Vickery, R.; Miller, A.; Saunders, E.; Rushton, E.; Valle, L.; Norcliffe-Kaufmann, L.

2026-04-22 neurology 10.64898/2026.04.20.26351214 medRxiv
Top 0.9%
0.8%
Show abstract

BackgroundAlthough neurogenic orthostatic hypotension (nOH) is a common and debilitating feature of multiple system atrophy (MSA), little is known about the burden of symptoms in the real world. ObjectivesTo design and conduct a cross-sectional community-based research survey targeting patients with MSA, with and without nOH. MethodsWe recruited patients with MSA to complete an anonymous online survey covering three core themes: 1) timely diagnosis, 2) nOH pharmacotherapy and refractory symptoms, and 3) confidence in physician knowledge. Responses were grouped by pre-specified diagnostic certainty levels. Relationships between symptoms, function, and pharmacotherapy were assessed using univariate and multivariate methods. ResultsWe analyzed 259 respondents with a self-reported diagnosis of MSA (age: M=64.38, SD=8.09 years; 44% female). In total, 42% also had a diagnosis nOH; 40% had symptoms highly suspicious of nOH, but no diagnosis; and 21% reported having never had their blood pressure measured in the standing position at a clinical visit. Treatment with a pressor agent was independently associated with the presence of other symptoms of autonomic failure. Each additional nOH symptom reported increased the odds of requiring pharmacotherapy by 18%. Yet, despite anti-hypotensive medication use, 97% of patients reported limitations in their ability to bathe, cook, or arise from a chair/bed with 76% needing caregiver support for refractory nOH symptoms. ConclusionsThis cross-sectional representative sample shows nOH is underrecognized and undertreated in MSA patients, leading to substantial functional limitations. It is our hope that these findings are leveraged for planning future trials and advocating for better treatments.

15
Quantitative and qualitative patient-reported analysis of misdiagnosis and/or late diagnosis of metastatic lobular cancer

Cody, M. E.; Chang, H.-C.; Foldi, J.; Jankowitz, R. C.; Balic, M.; Cushing, T.; Donnelly, C.; Freeney, S.; Levine, J.; Petitti, L.; Ryan, N.; Spencer, K.; Turner, C.; Tseng, G. C.; Desmedt, C.; Oesterreich, S.; Lee, A. V.

2026-04-20 oncology 10.64898/2026.04.16.26348799 medRxiv
Top 1.0%
0.8%
Show abstract

BackgroundInvasive lobular breast cancer (ILC) is the most commonly diagnosed special histological subtype of breast cancer (BC). Metastatic ILC (mILC) is less sensitive to FDG-PET imaging and often metastasizes to unusual sites --peritoneum, gastrointestinal (GI) tract, ovaries, urinary tract, and orbit--which may go unrecognized after a long disease-free interval. Some metastatic sites cause nonspecific symptoms, like abdominal/epigastric pain, with numerous published case reports of mILC misdiagnosed as gastric cancer. These atypical BC metastatic sites may lead to late and/or misdiagnosis, thereby delaying effective treatments. ObjectiveWe developed a patient survey to investigate the patient-reported prevalence of delayed diagnosis or misdiagnosis of mILC and their potential impact upon treatment outcomes. MethodsA 45-question survey was developed and piloted with breast cancer researchers, clinical oncologists, and patient advocates. This IRB-approved survey was then distributed to patients with ILC. Analyses including data QC and visualization were conducted in R using descriptive statistics. Incomplete or inconsistent responses were excluded, and summary statistics were stratified by four common mILC sites to highlight subgroup differences. Results525 patient surveys were completed, with 450 patients diagnosed with ILC, and of those 321 diagnosed with mILC. For those with mILC, 33.3% (n=107) were diagnosed with de novo mILC at initial presentation. Of the patients diagnosed with mILC, 32.1% (n=103) presented with other medical conditions at diagnosis. Misdiagnosis was reported by 26.2% (n=84) of patients with mILC, and of these cases, 31% (n=26) had [&ge;]2 misdiagnoses. The top 5 misdiagnoses were bone-related condition (24.7%), benign breast condition (23.4%), another type of BC (7.8%), diagnostic delay (7.8%), and menopause related (5.2%). 44.5% of patients waited [&ge;]1 year for an accurate diagnosis. 49 patients were treated for their misdiagnosis, and 6 received incorrect cancer treatments. The most frequently reported contributors to delayed or misdiagnosis were inconclusive imaging, providers lack of ILC knowledge, and initial misdiagnosis. Of the 321 patients with mILC, 138 (42.9%) reported symptoms before diagnosis; the most common were back pain (16.5%), fatigue/malaise (14.9%), GI symptoms (11.8%), bloating (8.4%), and weight loss (8.1%). Although 40% of patients reported having a mammogram at the time of their initial misdiagnosis, ILC was detected in only 20.5% (24/116) of these cases, and mammography detected only 5 (25%) of the 20 de novo mILC cases. Patients reported additional diagnostic testing within 1-3 months of their initial mammogram, includingbiopsy, ultrasound (US), and MRI. 47.9% of patients were in active BC surveillance after curative intent therapy at the time of their mILC diagnosis; however, no statistical difference was seen in time to diagnosis versus those patients not under surveillance. ConclusionOur survey results underscore the urgent need to improve diagnostic strategies for mILC. Addressing delays and diagnostic errors in mILC is critical to optimizing treatment strategies and improving patient outcomes.

16
Assessing Parent-cocreated Sensory Reactivity Outcomes in Children with Neurodevelopmental Disorders Undergoing Bumetanide Treatment: A Multiple-Baseline Single-Case Experimental Design

Geertjens, L. L. M. G.; Cristian, G.; Ramautar, J. J. R.; Haverman, L.; Schalet, B. B. D.; Linkenkaer-Hansen, K.; van der Wilt, G.-J.; Sprengers, J. J. J.; Bruining, H.

2026-04-23 psychiatry and clinical psychology 10.64898/2026.04.22.26351464 medRxiv
Top 1.0%
0.7%
Show abstract

Progress in pharmacological treatment development for neurodevelopmental disorders is hindered by a misalignment between targeted mechanisms, outcome measures, and trial designs. This study was initiated as a post-trial access pathway for bumetanide and later expanded with treatment-naive participants. Within this framework, we implemented a parent-cocreated sensory outcome measure set (PROMset) in an unmasked, multiple-baseline single-case experimental design with randomized baseline periods of 2-12 weeks, followed by 6 months of bumetanide treatment (up to 1.5 mg twice daily). Participants (7-19 years) had atypical sensory reactivity and a diagnosis of ASD, ADHD, epilepsy, or TSC. The primary outcome was a PROMset comprising seven PROMIS item banks assessing anxiety, depressive symptoms, sleep disturbance, fatigue, sleep-related impairment, cognitive function, and peer relationships. Secondary outcomes included SSP, SRS-2, RBS-R, and ABC. Of 113 enrolled participants (mean age 13.2 [SD 2.7], 64% male), 102 completed the trial and 95 had analyzable PROMsets. At baseline, PROMset scores showed substantial impairment across domains (mean deviation =9.0 T-score points, p<.001) and correlated with sensory reactivity (SSP; r=-0.40, p<.001). Individual-level analyses showed improvement in 24-41% of participants per PROM domain, most frequently in anxiety and depressive symptoms (41% and 38%; mean across-case Cohen's d=-1). Overall, 83% improved on at least one domain. Group-level analyses showed improvement across all secondary outcomes (p<.001), with superiority over historic placebo for RBS-R and SSP. Integrating PROMsets with individualized trial designs can reveal clinically meaningful changes, supporting a more sensitive and patient-centered framework for treatment evaluation in heterogeneous populations.

17
Genetic and Proteomic Investigation of the Smoking-Parkinson Disease Association

Shi, M.; Gunawan, T.; Setzer, M.; Okashah, N.; Liu, Y.; Wingo, T. S.; Wingo, A. P.; Weintraub, D.; Schwarzschild, M. A.; Rentsch, C. T.; Kranzler, H. R.; Gray, J. C.

2026-04-20 genetic and genomic medicine 10.64898/2026.04.17.26351138 medRxiv
Top 1%
0.7%
Show abstract

BackgroundEpidemiological studies show an inverse association between cigarette smoking and Parkinsons disease (PD), suggesting a potential protective effect of smoking on PD incidence, despite the well-established and overwhelming harms of smoking to human health. We integrated genomic and proteomic approaches to investigate the causality and molecular basis of this potential relationship. MethodsWe analyzed summary statistics from genome-wide association studies (GWAS) of smoking initiation (SmkInit), smoking intensity, and PD. Two-sample Mendelian randomization (MR) tested whether genetic liability to smoking behaviors causally influences PD risk. Shared genomic architecture was quantified using MiXeR, and conjunctional false discovery rate (conjFDR) analysis identified loci jointly associated with smoking and PD, which were then mapped to genes and tested for tissue enrichment. To identify mediating proteins, we integrated dorsolateral prefrontal cortex proteomic data with GWAS using proteome-wide association studies (PWAS), summary-based MR, heterogeneity in dependent instruments testing, and colocalization. Finally, the druggability of convergent genes was evaluated. ResultsMR analyses indicated a protective effect of genetic liability to SmkInit on PD risk (OR = 0.78, 95% CI: 0.67-0.91, P = 1.5 x 10-3), which was consistent across sensitivity analyses and not suggestive of directional pleiotropy. However, no significant effect of genetic liability to cigarettes per day (CigDay) on PD risk was found. MiXeR revealed modest polygenic overlap between SmkInit and PD (13.9%; genetic correlation rg = -0.16) and between CigDay and PD (22.9%; rg = -0.09). ConjFDR identified 95 shared loci for SmkInit-PD and 26 for CigDay-PD. SmkInit-PD loci mapped to genes involved in neurotrophic signaling, synaptic organization, microglial modulation, and mitochondrial stress responses, with expression enriched in substantia nigra, basal ganglia, and interconnected cortical regions. PWAS identified 11 proteins shared by PD and SmkInit and 5 shared with CigDay, several of which (AKT3, MAPT, RIT2, EXD2, and PPP3CC) were supported by both genomic and proteomic analyses. Druggability assessment highlighted six proteins with existing pharmacologic modulation potential, spanning neurotrophic, microglial, proteostatic, and ion-channel pathways. ConclusionsGenetic liability to smoking initiation appears to confer modest protection against PD. Integrative genomic and proteomic evidence converges on neurotrophic, synaptic, microglial, and mitochondrial pathways as shared mechanisms, identifying biologically coherent potential therapeutic targets for advancing smoke-free neuroprotective strategies in PD.

18
Leveraging Predictive AI and LLM-Powered Trial Matching to Improve Clinical Trial Recruitment: A Usability Assessment of Trialshub

Blankson, P.-K.; Hussien, S.; Idris, F.; Trevillion, G.; Aslam, A.; Afani, A.; Dunlap, P.; Chepkorir, J.; Melgarejo, P.; Idris, M.

2026-04-20 health informatics 10.64898/2026.04.17.26351107 medRxiv
Top 1%
0.7%
Show abstract

BackgroundRecruitment remains a major barrier to timely clinical trial completion. Trialshub is an LLM-powered, chat-based platform intended to help users identify relevant trials and connect with coordinators to streamline recruitment workflows. ObjectiveTo evaluate the perceived usability and operational value of Trialshub, and identify implementation considerations for real-world deployment. MethodsA usability test was conducted at Morehouse School of Medicine for the Trialshub application. Purposively selected participants included clinical research coordinators and individuals with and without clinical trial search experience. Participants completed a pre-test survey assessing demographics, digital health information behaviors, and familiarity with AI tools, followed by a moderated usability session using a Trialshub prototype. Users completed scenario-based tasks (locating a breast cancer trial, reviewing results, and initiating coordinator contact) using a think-aloud protocol. Task ratings, screen recordings, and transcribed feedback were analyzed descriptively and thematically, and reported. ResultsParticipants reported high comfort with using digital tools and moderate-to-high familiarity with AI. Trialshubs chat-first design, guided prompts, and checklist-style eligibility display were perceived as intuitive and reduced cognitive load. Fast access to trials and the coordinator-contact workflow were viewed positively. Key usability issues included uncertainty at step transitions, insufficient cues for selecting results and next actions, and inconsistent system reliability (loading delays, errors, and broken trial detail pages). Participants also noted redundant questioning due to limited conversational memory, requested improved filtering/sorting, and clearer calls-to-action. All participants indicated that Trialshub has strong potential to meaningfully improve clinical trial processes. ConclusionsTrialshub shows promise for improving trial discovery and recruitment workflows, with identified design implications for real-world deployment.

19
MIMIC-IV-Phenotype-Atlas (MIPA) : A Publicly Available Dataset for EHR Phenotyping

Yamga, E.; Goudrar, R.; Despres, P.

2026-04-24 health informatics 10.64898/2026.04.16.26350888 medRxiv
Top 1%
0.7%
Show abstract

Introduction Secondary use of electronic health records (EHRs) often requires transforming raw clinical information into research-grade data. A central step in this process is EHR phenotyping - the identification of patient cohorts defined by specific medical conditions. Although numerous approaches exist, from ICD-based heuristics to supervised learning and large language models (LLMs), the field lacks standardized benchmark datasets, limiting reproducibility and hindering fair comparison across methods. Methods We developed the MIMIC-IV Phenotype Atlas (MIPA) dataset, an adaptation of MIMIC-IV that provides expert-annotated discharge summaries across 16 phenotypes of varying prevalence and complexity. Two independent clinicians reviewed and labeled the discharge summaries, resolving disagreements by consensus. In parallel, we implemented a processing pipeline that extracts multimodal EHR features and generates training, validation, and testing datasets for supervised phenotyping. To illustrate MIPA's utility, we benchmarked four phenotyping methods : ICD-based classifiers, keyword-driven Term Frequency-Inverse Document Frequency (TF-IDF) classifiers, supervised machine learning (ML) models, and LLMs on the task. Results The final MIPA corpus consists of 1,388 expert-annotated discharge summaries. Annotation reliability was high (mean document-level kappa = 0.805, mean label-level kappa = 0.771), with 91% of disagreements resolved through consensus review. MIPA provides high-quality phenotype labels paired with structured EHR features and predefined train/validation/test splits for each phenotype. In the benchmarking case study, LLMs achieved the highest F1 scores in 13 of 16 phenotypes, particularly for conditions requiring contextual interpretation of clinical narrative, while supervised ML offered moderate improvements over rule-based baselines. Conclusion MIPA is the first publicly available benchmark dataset dedicated to EHR phenotyping, combining expert-curated annotations, broad phenotype coverage, and a reproducible processing pipeline. By enabling standardized comparison across ICD-based heuristics, ML models, and LLMs, MIPA provides a durable reference resource to advance methodological development in automated phenotyping.

20
Drug-Target Mendelian Randomization and Imaging Mediation Analyses Reveal Therapeutic Targets and Causal Mechanisms for Cardiomyopathies

Wang, P.; Song, Y.; Zhang, B.; Yang, J.

2026-04-22 cardiovascular medicine 10.64898/2026.04.20.26351344 medRxiv
Top 1%
0.7%
Show abstract

Abstract Background: Hypertrophic (HCM) and dilated (DCM) cardiomyopathy constitute the principal phenotypes of primary cardiomyopathy, yet both lack sufficient therapeutic options. Integrating genetic insights with detailed cardiac phenotyping offers a promising strategy to prioritize targets and elucidate their mechanisms of action. Methods: We conducted an three-stage analysis. First, drug-target Mendelian randomization (MR) was performed using cis-acting protein (pQTL) and expression (eQTL) quantitative trait loci as genetic instruments for potential drug targets. Second, we examined causal associations between 82 cardiac magnetic resonance (CMR)-derived imaging traits and HCM/DCM risk in a CMR-based MR analysis. Third, mediation MR was employed to quantify the proportion of the genetic effect of prioritized drug targets on cardiomyopathy risk that was mediated through specific CMR phenotypes. Results: Our analyses identified 19 and 13 potential therapeutic targets for HCM and DCM, respectively. CMR-based MR revealed that HCM risk was causally associated with increased right ventricular ejection fraction (RVEF) and greater left ventricular wall thickness, whereas DCM risk was linked to ventricular dilation, impaired myocardial strain, and altered aortic dimensions. Critically, mediation analysis established that these CMR traits served as significant intermediate pathways. The protective effect of ALPK3 on HCM risk was mediated through a reduction in myocardial wall thickness. Conversely, the effects of PDLIM5, HSPA4, and FBXO32 on DCM risk were exerted in part via alterations in aortic dimensions. Conclusion: This integrative genetic and imaging study systematically identify candidate therapeutic targets for HCM and DCM and delineates the specific CMR phenotypes through which they likely exert their causal effects. Our findings advance the understanding of disease pathogenesis and highlight new possibilities for improving the diagnosis and management of cardiomyopathy.